GeoRep: A Flexible Tool for Spatial Representation of Line Drawings

نویسندگان

  • Ronald W. Ferguson
  • Kenneth D. Forbus
چکیده

A key problem in diagrammatic reasoning is understanding how people reason about qualitative relationships in diagrams. We claim that progress in diagrammatic reasoning is slowed by two problems: (1) researchers tend to start from scratch, creating new spatial reasoners for each new problem area, and (2) constraints from human visual processing are rarely considered. To address these problems, we created GeoRep, a spatial reasoning engine that generates qualitative spatial descriptions from line drawings. GeoRep has been successfully used in several research projects, including cognitive simulation studies of human vision. In this paper, we outline GeoRep’s architecture, explain the domainindependent and domain-specific aspects of its processing, and motivate the representations it produces. We then survey how GeoRep has been used in three different projects–a model of symmetry, a model of understanding juxtaposition diagrams of physical situations, and a system for reasoning about military courses of action. Introduction: How Diagrams Work Diagrams are ubiquitous. In daily communications, through sketches, maps, and figures, people use diagrams to convey information. Some diagrams depict intrinsically spatial domains, such as bus routes or furniture arrangements. Other diagrams use spatial concepts to compactly show more abstract relations, such as corporate hierarchies or data flow in a computer program. In all such domains, diagrams can be extremely effective. It is also true, however, that there is a keen difference between effective and ineffective diagrams. Small visual differences may distinguish a diagram that elucidates from one that confuses (Tufte, 1990). A key difference between good and bad diagrams is how well they utilize the kinds of qualitative spatial relations most easily perceived by the human visual system. In the best diagrams, these spatial relations support the conceptual relations the reader is meant to infer. For example, in a thermodynamics diagram, an arrow may indicate the direction of heat flow, with thicker arrows to indicate greater flow, or tapering arrows to indicate heat dissipation. Or, in a circuit diagram, wires Copyright © 2000, American Association for Artificial Intelligence (www.aaai.org). All rights reserved. may be drawn so that related wires are adjacent and parallel, so they can be visually grouped. For this reason, to understand how diagrams work, we must show how diagrams use visual characteristics to support particular qualitative inferences. In the system described here, we model this process as an interaction between two representation levels: 1. A low-level, domain-independent representation which involves a representative set of primitive spatial relations. This level models human low-level vision. 2. A high-level, domain-specific representation that models visual skills for a particular domain. This level links lowlevel visual relations to a domain’s conceptual content. These two representation levels form the basis of GeoRep. GeoRep is an engine for building diagrammatic reasoners. GeoRep takes as input a line drawing, given as a set of primitive visual elements. From this drawing, GeoRep creates a predicate calculus representation of the drawing's visual relations. To perform this task, GeoRep, given the drawing, examines the primitive shapes in the figure, looking for a broad set of low-level visual relations. These relations are detected by a library of visual operations (assumed to be domain-independent) which partially cover the set of universal visual routines (Ullman, 1984). Next, GeoRep uses these relations, in combination with domain-dependent rules, to generate the second, domain-specific representation. GeoRep's two-level

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Cognitive Approach to Sketch Understanding

Sketching is an interactive process of communication, using drawings and linguistic information to convey spatial and conceptual material. Our work on a computational model of sketching has the goal of both explaining human sketching and creating software that can be a human-like partner in sketching interactions. This focus has lead us to explore a very different set of tradeoffs from those ty...

متن کامل

From Diagrams to Models by Analogical Transfer

We present a method for constructing a teleological model of a drawing of a physical device through analogical transfer of the teleological model of the same device in an almost identical drawing. A source case, in this method, contains both a 2-D vector-graphics line drawing of a physical device and a teleological model of the device called a Drawing-Shape-Structure-Behavior-Function (DSSBF) m...

متن کامل

Qualitative Spatial Interpretation of Course-of-Action Diagrams

This paper demonstrates qualitative spatial reasoning techniques in a real-world diagrammatic reasoning task: Course-of-Action (COA) diagrams. COA diagrams are military planning diagrams that depict unit movements and tasks in a given region. COA diagrams are a useful test bed for researching diagram understanding due to their composable symbology, their intrinsically spatial task, and their us...

متن کامل

A structural representation for the understanding of line drawing images

In this paper, we are concerned with the problem of finding a good and homogeneous representation to encode line drawing documents (which may be handwritten). We propose a method in which the problems induced by a first step skeletonization have been avoided. First, we achieve a vectorization of the image that enables a fine description of the drawing using only vectors and quadrilateral primit...

متن کامل

Core geometry in perspective.

Research on animals, infants, children, and adults provides evidence that distinct cognitive systems underlie navigation and object recognition. Here we examine whether and how these systems interact when children interpret 2D edge-based perspectival line drawings of scenes and objects. Such drawings serve as symbols early in development, and they preserve scene and object geometry from canonic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000